Working towards a Metadata Federation of CLARIN and DARIAH-DE
نویسندگان
چکیده
Over the past years great effort went into the establishment of research infrastructures for the Humanities and Social Sciences. As a result of the diversity of their targeted research fields and communities, miscellaneous infrastructure projects have developed unique solutions for overlapping target groups. This especially holds for describing available resources by structured metadata and providing them to a wider audience in a user-friendly fashion. This paper focuses on recent work to overcome the gap between the metadata infrastructures of CLARIN and the German branch of the DARIAH project, focusing on design decisions made by both projects and preliminary work focusing on the synergetic evolution of both.
منابع مشابه
CLARIAH in the Netherlands
I introduce CLARIAH in the Netherlands, which aims to contribute the Netherlands part of a Europe-wide humanities research infrastructure. I describe the digital turn in the humanities, the background and context of CLARIAH, both nationally and internationally, its relation to the CLARIN and DARIAH infrastructures, and the rationale for joining forces between CLARIN and DARIAH in the Netherland...
متن کاملCreating & Testing CLARIN Metadata Components
The CLARIN Metadata Infrastructure (CMDI) that is being developed in CLARIN (Common Language Resources and Technology Infrastructure) is a computer-supported framework that combines a flexible component approach with the explicit declaration of semantics. The goal of the Dutch CLARIN project “Creating & Testing CLARIN Metadata Components” is to create metadata components and profiles for a wide...
متن کاملFacilitating Metadata Interoperability in CLARIN-DK
The issue for CLARIN archives at the metadata level is to facilitate the user’s possibility to describe their data, even with their own standard, and at the same time make these metadata meaningful for a variety of users with a variety of resource types, and ensure that the metadata are useful for search across all resources both at the national and at the European level. We see that different ...
متن کاملUsing TEI, CMDI and ISOcat in CLARIN-DK
This paper presents the challenges and issues encountered in the conversion of TEI header metadata into the CMDI format. The work is carried out in the Danish research infrastructure, CLARIN-DK, in order to enable the exchange of language resources nationally as well as internationally, in particular with other partners of CLARIN ERIC. The paper describes the task of converting an existing TEI ...
متن کاملFeatures for Generic Corpus Querying
The availability of large corpora for more and more languages enforces generic querying and standard interfaces. This development is especially relevant in the context of integrated research environments like CLARIN or DARIAH. The paper focuses on several applications and implementation details on the basis of a unified corpus format, a unique POS tag set, and prepared data for word similaritie...
متن کامل